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This paper studies the relationship ol Artificial Intelligence to the 
study ol language and the representation of the underlying 
knowledge which supports the comprehension process. It develops 
the view that Intelligence is based on the ability to use large 
amounts at diverge kinds of knowledge in procedural ways, rather 
than on the possession of a tew general and uniform principles. 
The paper also provides a unifying thread to a variety of recent 
approaches to natural language comprehension. We condude with 
a brief discussion of how Artificial inlel5igence may have a radical 
impact on education it the principles which it utilises to explore the 
representation arsd use ef knowledge are made available to the 
student to use in his own learning experiences. 



This paper is a revised version of an earlier document written with 
- Marvin Minsky. Many of the ideas in this paper owe much to 
Minsky T s thoughtful critique; the authors, however, take 
responsibility for the organisation and wording oi this document. 
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flrt-ilLeial Intelligence. Language and Hie Study ol Knowledge 

Artificial Intelligence (AI) represents a new style of thinking about cognition thai is having an 
important impact on [tic study of language. The goals of Che e«ay ars 

- to convey a sense of the intellectual personality of Artificial Intelligence; and 

- to identify I0m« trends in its development which we see at especially relevant to- the study 

of language and. of the TOk of an AE approach to language in education. 



1» firtilicial Intelligence as the Study ol TnbeLliqer^ Pr&eesscs 

Before we embark Dn the substance of [his essay, it is worthwhile (o clarify a. potential source 
□f confusion. For many, AI is identified as i narrowly focussed field directed towards the goal: of 
programing computers in iuch a fashion that the) 1 acquire the appearance of intelligence. Thus it 
may seem paradoxical that researchers in [he field have anything to say about the situ dure of 
human language or related issues in education. However, the above description is misleading. It 
correctly delineates: the major methodology of [he science, that is, the use of computers [q build 
precise models Of cognitive theonet. Bui it mistakenly identifies this as the only purpose of the 
field. Although there ii much practical good ?hat can come of more intelligent machines, the 
fundamental theoretical goal of the discipline is understanding intelligent processes tndt pmdent of 
their particular physical realisation. 
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Naturally, understanding mteJligence abstractly has intimate interactions with understanding it 
in 1» natural form. Thus, AT and psychology luve dose ties. Indeed, Al has been called 
"thEQ ret Leal p J y C h-0 1 Og y" (Newell, 35 /J). AI draivs from psychology a SM of basic concern* - 
understanding language, perception, memory, problem solving. Psychology, In turn, acquires a rev 
framework of computational ideas fOT expressing co^nirive [henries. Indeed, a major purpose of 
psychology becomes ;he dlKOVery not of the ciass of program* that could possibly explain a given 
imellig-enc behavior (thii is the archetypical concern of A[} f but rather the particular program that 
a gjven individual actually possesses 



LI A Knowledge-Oriented rather than Power- Oriented Theory of IntelliEencs 

Given the general perception of Al as the computational study of intelligence, it srill remains. " 
10 identify the particular set of questions and theoretical constructs rhat a researcher in the field 
bring! to bear on cognitive problems. En the earliest years of the field (i.e. during the Late fifties 
and early sixties), the hope was that :s few simple but very powerful technique! could be identified 
that together could be used re create intelligent program*. During this period, theorem proving 
and search played a primary role. Indeed, to many A] has become identified as the science of 
heuristic starch that Ji, The study of procedural techniques for exploring state spaces too Large to be 
explored in an exhaustive fashion. During these early years, an AI paper cm language might have 
begun by representing the state space by means of a formal grammar and then have fOcUSScd on 
the issue Of achieving successful parsings in reasonable times. (Green, ]S63i Lindsay. 1*63). 

Today there has been a shift in paradigm. The fundamental problem of understanding 
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intelligence If not -the idennfucaiton of a few powerful Hxhniquei, but rather the question of how 
to represenr large amunti afknxulertgt in a fathOm ihat permit thttr tfftttive use and fatiTaciion. 
Thii ihift n based on a decad* or experience with programs [hat relied on uniform seonch or 
logistic techniques that proved to be hopelessly inefficient when faced with complex problem* 
embedded in large knowledge spaces. The current point of View ii [hat the problem solver 
(whether man or machine) must know explicitly how to use its knowledge — with general 
technique* supplemented, by domain specific pragmatic knowhow. Thus, we Jtfe AI as havjng 
lb If ted from a powtr-bastd strategy lot achieving intelligence to a kiutorhdgt-baicd approach, 
Minsky and Paper; characterize ihese two pomes of «irw as. follows: 

The PrjUEfl 1 strategy seeks a generalized increase in 
computational pewir,; ]t may look tOtjare" n«u kinds of computers 
{•parallel" Or " fuzzy" or "attoclst ive* or whatever H op it may 
loofc tu+jard en tensions of deouc-five hengra I i tu. T or information 
-e:-eval, or search algorl thm* - things like better "resolution" 
methods, fcettor nethoila tor- exploring trees and net** haah-tooied 
triplets* etc In each cage fh* Inprevaient sought is intended' 
to be "uniform" — intlepundent nf thd particular data bass. 

The <\QLJLE3GE strategy sees progress as coming fro* better 
uau» to express, recognize, and use Ctivtr SB and particular *er*i| 
Of knouJedge, This thecry sees the problerc as ep i stereo I ogi eal 
rather than as. a matter of computational pDi*er or mathematical 

■ 

generality. It supposes, far example, that Mhin a ecientist 
soIvbb a neu nrooiem,. he engages a highly organized structure of 

p 

especially aporDC^'ate fscta h *indeis d analogies, planning 
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Mechanisms. Mtf-d!i scipl ins procedures, etc. To be sure, he a»so 
engages "general" prob I eii-*c Iving achen^va ObS i t i S Oy no ■jeans 
Coitus that very smart people are that way directly because pf 
the superior power of their general method a --at compared ul th 
average people. Indirectly, perhaps, c>ut that 1b another (natter: 
a very intelligent person night be that uay because of specific 
zzj. features of hi* knauledse organ iziiy ^nc-HleOga ra!h*r tha.n 
becauae of global qualities of his *th5nkihg H which, except for 
the effects of hue aal f-app| ied knowledge, miflht be litt In- 
different fron a child's. 

- 

«rtinsky T Paper t, 197*« p. S3] 

I[ ii vrarthwhjl* to observe here chat the goali of a knowledge-based approach to AT are 
closely akin to those which motivaied Pia E et eq h u his research center "Centre dTpUttmoLogie 
Cenctique"; or, more precisely, re his reason for calling himself an "tpistemciogist" rather than a 
psychologist. The common theme ji rhe view that the process of intelligence is determined by the 
knowledge held by the subject. The deep and primary question* are to understand ?he operations 
*ftd data structures involved. The physical (biological or electrical) mechanism* underlying them 
ar# not wen by us or by Ptaget as. che Source of intelligence in any structural sense <a It hough 
intelligence could not happen without some such embodiment.} 

Perhaps the knowledge paradigm Ii matt clearly characterized by a developmental perspective. 
Naturally, intelligence is im possible without a certain basic "hardware" capability. But our position 
and Placet's is that this capability is already present in th* very young child- The subsequent 
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development of intelU^enee li not due to the emergence of new biological capabilities in the 
individual Rather, increased performance is due to the acquisition of knowledge. This knowledge 
can range from specific facts about a domain to general problem soling schemata. Indeed, II may 
even be knowledge about how to team (far esample, facts about debugging techniques). Some 
fao* resuft in only local improvements m the individual's capabilities, white others may have 
global effect giving rise to a stage transition in PLagetian terms. From the computer standpoint, 
we see the need for some improvement j in jpeed and memory capabilities especially for such 
special purpose applications 44 vision and speech. But the fundamental difficulty racing 
researchers in the field today are not limitations due to hardware, hut rather questions about how 
to represent large amounts of knowledge in ways that Still atlow the effective use of individual 
fa.fiS. 



1-g Artificial Intelligence as a Procedural Theory of Knawlatfefr 

The reader may perhaps wonder at this point how Al k to have an identity of its own ir the 
knowledge strategy is pursued. Lt if U asks only for explicit representations of the knowledge of 
particular domains. Isn't this, simply the concern of the specialized fieW of study associated wuh 
that domain? How does this notion of Al. for example, distinguish it from linguistics when the 
domain is that of language competence? 

_ 

The answer lies in identifying two concerns of At which are uniquely praadnral in nature. 
The first is that the knowledge of a particular domain must be repiesemed in a form that is 

useable. A linguist may content himself w a (h the specification of some attribute of grammatical 
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knowledge by defining the appropriate transformation: an Al researcher must go one step further 
by asking how this transformation an be woven mto the m*| * f b of linguistic information m 
such a way that the comprehension or generation pracca is possible in reasonable computational 
times. Thus, AI brings to the traditional itud? of a g^en fiebd the quest for identifying and 
formalizing pngwatk knowledge, namely knowledge of "how" rraditional facts are to be used. 

There n a second concern which distinguishes Al from traditional studies of a given domain, 
This is its concern for identifying the manner in which competence, in different domain* inset 
Again drawing upon an example from ths Kudy of language., a grammarian can legitmaiely f< 
on attributes, of grammar divorced from the semantics or use of language. The essential 
of AI, however, is to conduce fVamewoiks in which ali of these divers* kinds of knowledge 
successfully interact in order to alio* the complete comprehension process to take place. Thus, both 
concerns which make AT unique arise i rorft its responsibility to structure knowledge into procedural 
system* That can actuate soive problem*. 



:ract. 



locus 



Con tern 



■Ldn 



LB Organization of this Paper 

+ 

Our goal in the remainder of this paper wilt be to provide the reader with insight into the 
structure of theories cunwily being formulated in Al within this knowledg=-b«ed paradigm, 
Section* 2 and 9 illustrate the A] approach by dicing a particular example -- frame theory -- 
which many people m the field are currently expl^ng and fmd quite earing. Section 2 
introduces the subject by approaching frame theory as an Al extension of the traditional linguistic 
notion of «*e grammars from a narrowly syntactic theory to a general representation for' 
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knowledge- Section 3 discusses the wloui attributes of At theories that frame* exemplify. In 
lections 4 and &. we deepen our view of Al at a computational style of epi&tem&logy by discussing 
two aspects of cognition and language to which Al has nude unique contributions -- the role of 
self-knowledge and the Importance of control structure. Section 6 concludes our theoretical 
discussion by briefly viewing in perspective the imelfectual history of the field. 

J 

These topics do noc by any means- exhaust the contribution of Al to the study of language. 
But we believe they provide an excellent illustration of the Al methodology. And (hat items to u$ 
the only serious purpose to be served by a paper of this length that is accessible both to students 
of the field and to non -specialists. Two companion essays to this paper were prepared for the 
National Institute of Education by VVinagrad (l$%) and Wilk* (1976) and compiemem the 
discussion presented Here, 

Throughout the paper, we avoid ea tensive excursions into formalism, and jargon, More 
seriously, we Also ignore, or pass, over in a cursory way, a variety of serious technical issues. We do 
this In order to focus auenti&n on the overall enterprise of a computation:*] investigation of 
knowledge. We feel especially justified because of the tendency in the literature to fwu& on 
narrow techical issues, Nevertheless, to compensate for this broad focus, we provide in section 6.2 
a hit of the technical questions which we feel are Critical for the reader who wishes to pursue this 
point of view more deeply. Also* the reader should rewgniie that the various research we point to 
AS illustrating our position have many important differences H&Wever. we feel thAt the critical 
need at this time is to develop a feeling for the paradigm which character's Al as a method for 
Investigating knowledge and language, and it is to thii goal that we direct our efforts. Finally, we 
and Others in the Al community expect to develop further the ideas raised m this paper, The 
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interested reader ii invited to writ* us for rtfertncei to the latesr research in [his area. 



1.4 Educational Applications 

The final part of the e«a;f (section 7) will touch schematically dji some of The consequences for 
education Of Artificial Intelligence jeseardh On natural language. Their consequence.! are of very 
different sorti. Some of them, no doubt for most people the most obviOUJ, are consequence* of 
actually possessing machines capable of undemanding what a student taps. Obviously such 
machine* would greatly improve the quality of Tomputer Aided Instruction" {in the sense or 
tutorial programs, programmed instruction acmirmtejrd by computers, "drill and practice", 
Computerized advice, etc) 

Much deeper consequences, however, are related co th-e intellectual content of Artificial 
Intelligence research on natural lati|uage. As a new kind of cognitive theory it his obvious 
implications !"or the education. It is our personal opinion that these consequences will very deeply 
transform the theory and practise Of education, Although we Will not try to argue for this position 
at length, we shall cry to convey the flavor of the perceptions an which It is based. 
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3, Frames as a Undying Kp praach to Reprcaenfina Knowledg e ■ 

Frame Theory ii an A| approach m rtprwnun^ knowing, (bai generalize* the lin|uist.c 
notion of case grammars in * manner that allowj the repr« cn t at ton not only of syntactic 
fcnowiedge, but abo procedural knowledge for how to construe * parse as well at related semantic, 
contextual and thematic facts. Historically, frame theory is fas*! upon nwe than purely linguistic 
consideration*. In the Original formulation by Mimky <i9ty, the notions of frames and Trame 
iystems were developed in equat measure from problems arising from the construction of adequate 
theories for vision, for memory, for logical reasoning and for the comprehension of language. 
However, we shall narrow our focus in this section id confer the utility of frames solely in 
relation to linguistic knowledge ] n section 3, we iKp back fror> ihe realm of lang uage to consider 
a Variety of larger issues that characieriie the At approach to representing knowledge. 

■ 

2.1 Case Grammars 

The ca« frame approach to linguistics as developed by Fillmore 0966) and others constrain* 
the enquiry to the realm of syntactic knowledge. Withm this sphere, the bas.t idea U to organii* 
structural knowledge of the sentence around the verb,' The result is a „» grammar which 
S p=cifiei for each verb the cases which it requires. To illustrate this, we use the traditional 
linguistic devi« of choosing a repress t.ve smtence around which to or^nite our observation* 

($1) John tickled the girl with the feather. 
Following Chafes formulation, a case-oriented approach would, represent SL as follows- 

Tieklet agent John 

patient the girl 
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2.2 Understanding a Sentence 



To make trie linguistic enterprise tractable, the linguist tin ignore the issue of understanding 

. an utterance beyand the narrow concert rf jdencif^Lnj iti sjmtattk structure. But AI is forced to 

address thii Larger issue if computational modeli are to t» deigned thai actual!, act appropriate 

in response to the utterance Th«s> AI a willing to consider the semes of understanding SI 

llluilralrd by the following three listeners: 

A five year old child, who has a dear image of John using the feaiher to 
evoke giggles. 

A slightly more sophisticated Jtstener capable of taking account of i context 
iuch as: "Atl the gjrls in the room had orange hats- One was decorated with a. 
large feather,, another «vith a. towering confection of artificial fruit. JoAn 
ttehltd (Ae girl u>lth th« fptfXtr' 

A linguist who understands the sentence as an example of a well known 

• ambiguity m parsing, 

It is obvious that all the subjects use some grammatical knowledge. For example the order of 
the words tells them that it was John who did the luting and not the girl. But grammatical 
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knowledge alone is not ajfficLm* even to account for the syntactic parsing of the sentence, 1 e. the 
Interpretation of the phrase "with the feather" as adjectival or adverbial This pemt 11 made clear 
by cone ra it i rig the sen^ricF *ith S? 

(S2) John tickled the girl with the funny nose. 
Even our Ti^ year eld would net be tikel, to read thit as if John uud the nose as a cicMmg 
initrument. The difference h not punling to common sense: after all the ch.id knows that 
feathers are good r or tickling whik n«« are more likely to be used for celling or as personal 
identifier*. But the difference jhould trouble the cognitive :he?nrt «ho know* how to describe 
ayntactic structures in fine detail but has no better description than naive common sense of the 
child's knowledge about feathers and noses. And the difference is relentlessly present for the 
computer-oriented theorist who absolutely needs 19 find wayi to formalize it. 



2,3 The Procedural Idea of a Frame-Keeper 

Let us begin our generalJjail&n f a se grammars by discussing informal])- how a procedural 
case theory mi^h; be MganiHd. Following this, we v,,|| discuss work by R.esbeck dm) that 
represents one formal realization of this point of view, 

In cur informal mini-theory there it associated with each important word in the sentence an 
active agent wh,ch is smctlj defined as a computational procedure but which we might juit ai *eii 
anthropomorphic (for the moment anyway) u a person. Let's rail them FrcmfKrtt^rj for 
reasons which will soon become apparent. The procwj of understanding the sentence w t |l consist 
of an interaction between these Frame-Keep^ wherein each tries to fulfill certain needs. For 
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example the needs dF the TlCKLI-FR AME keeper include: 
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Find the agent (the person responsible for the tickling]! 

Find the pa'.Lcnt [the person being uckledj! 

Find the instrument (the means by which the agent is [icillng the patient)' 

So the TICKLE-FRAME keeper Ii^h active^ seeing l0 m these slots written into hit 

definition. If all of them can do so to their satisfaction (an idea which toe wj[| pursue shortlyj the 
sentence is to be considered aj parsed, 

The essence of the AI extension of the case frame idea js that the frame for the verb contain 
not only Syntactic information abou; :hc expected location of [he underlying Cases, e.g. in a 
declarative sentence :hat the subject is the agent, the object the pattern and the - w , rr f prepositional 
phrase the instrument, hut also semantic and pragmatic facts. Thus, we imagine that for our 
tickle-fume, the Frame-keeper has access to world knowledge that nates that Feathers are typical 
tickling instruments, while other kinds of objects, for example, dumbbells, are not. Hence, it would 
be the tttkicfTcmt U?ptr that would b* in a position to reject for 52 that the "with 1 phrase refers 
to the tickle instrument because it knows that one generally does not tickle with nosei 

In section 2*. we discuss in existing program that associates both syntactic and semantic 
knowledge with ind,vjdual words in order to comprehend the sentence. Jn secnon 2.5, we indicate 
how the frame idea can be extended to nouns. Given that extension, the process of 
comprehending S2 becomes clearer as not only does the tickle frame reject the nose phrase, but the 
*ir] frame accepts it as a legitimate descriptor of people. Thus, the system ha, two source* of 



Goldstein, Papert IS Knowledge, Language and At 

■ 

evidence both supporting one another regarding the intended parse. 



2.4 A Formal Raalizaticn of the Frame Idea 

Our discuuion of [he TtCKLE-FRAME and frame-keepers represents the kind of concern 
for fldtotf tnoiw/ftJf<? that AI brings tg lingu LSLiC studies. C. Riesbetk (l$H L9/5) has designed and 
implemented a program that, although not f rime-based, shares many important prcpejbes. with the 
approach outlined above, Riesbeck's parser maps natural language mio The Concept us] 
Dependency structures deigned by $cha.n* (1973). SJ is typical of the sentences that Riesbecli's 
system analyzes: ', 

(S3) John gave Mary a beating 
Prececal rai in Forma Uon J J associate wj;h each word in the sentence, which is triggered when the 
parser encounters the word in its le:'KO<nghl analysis. This Information cgnsrructs an instantiated 
ease-frame-like structure. It also contains expectations that serve to disambiguate multiple ward 
ienses and properly decode the syntactic cues that indicate the filler* for each slot in the case-frame 
associated with the Verb. A trace of the a na lyiLs of the parser is given in figure 1 r 

- 

The iystem, called MARGIE, of which Rebeck's parser was a part, exhibited iti 

understanding by generating paraphrases of the original sentence and by mating deductions based 
on it (Sr.ha.nlt, 19T3X. The basic operation was to comprehend the sentence by mappLng it imp the 
conceptual dependency representation developed by Schank (19V J). Figure 2 illustrate* the 
conceptual dependency diagram generated for the above sentence. The details, of Scbank's. " 
representation are not important here, ex«pf to note that M ARGIE understood the sentence as 
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John 



t gave 



1-1k there tin 
NP? 



L-ia there 
HP? 



.-in 



i-ic the current 
NP a hunan? 
3-le the cuxratic 
HP an object? 
4--1& ttie current 
KF in action? 

5- true 



REQUESTS TR I GCIJU.I) 
IWfH 



Mary 



t-is the ctirtent 
NP a huoaa? 
J— is Eh* current 
NP an object? 
4-is the current 
JfT m action? 



6 



bcatlnft 



J- is the cut rent 
NP an object? 
ft-ii th* current 
NP an d^Lion? 
■fr-crue 



peripd 



I IUI Id 



7-does the tur- 
rent word end 
an HPT 



7-dnes tile cur- 
rent trafd end an 
KP? 



fli-nfc 



3- la the currant 

HP an object? 
4-ift the current 
HP an action? 



ACTIONS TAKZW 



Biihjeet oJ the verb 
ta follow 



it-B'jme the word 
"to," if It appear* > 
Introduces che re- 
cipient af che 
"siwina" 



assuoe tiflry is the 

recipient of the 
"giving 1 ' 



save the current 
list at reduecta and 
replace it Wtthi 
7-dgeg the current 
"W-d end an HP? 



build the NP "a 
beating" and reset 
the list oE request* 



assume the NT ,ir-: cui 
is ttie Haiti action 
of the clause, the 
Subject {John} la 
the actgr find the 
recipient (Mary) is 
tile Object 



Fipira 1 
Traca of the HU&hecfc Parser on "John gave Kflry a beating. 
Ftani (Hieabeet 1974. p. 31 j 
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REFIaTLLO 



HA3I.Y £- 



i 



.nlHH 

3p 
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JUHD (JOHN) 
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Figur* 2 

ConceptiLaL Psjiendertcjr fccprtteen mc i™ of the neanin e i?f the atTitcncaj 
"John £4*e Kary a beating." 

FtOtl (fcieEbeck JS74. P. Ifrfl), 
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referring 10 a series of action* by John involving the motion of his hand, Riesbect's parser has 
not erroneously interpreted *give H to mean that John is transfer^ the possesion of something to 
Mary. 



2,5 Frames far Represent! nit Kn&wlcdfi g about Nmm B 

The caw frame Idea in its ordinal linguistic form was on en ted Awards representing 
knowledge about verbs, AI pursuing the goal or comprehensive theories of knowledge has 
generalized the idea to allow frames for nouni. 

Consider again the sentence "John tickled the girl with the feather" We imagine that pan of 
the knowledge for properly understanding this sentence Ji that the verb-frame for tickle has as a 
typaat mstrument "feathers.' Such verb-cantered knowledge ii supplemented by framei for such 
noun* as "girl' which Contain pointers to sub-frames whjth describe typical appearand, purposes. 
Mcivities, etc. The appearance sub-frames would indicate that girls do not "have 14 feathers. They 
might under closer scrutiny indicate thai "feather* - are a possible but unlikely kind of dt 
However, the strong default for H feath*ri*as a typical nckiing instrument wmjfc predomin; 
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A related example ihowi where noun-frame information might dominate the verb-frame 
default. Consider fh« three sentences: 



i- 



(54) | frightened the dug- with the loud noise, 

(55) I frightened the dog >*ith the brown fur. 
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{£6) I frightened the dog With the lend birk. 

We believe most readers would understand Si by assuming thai the 'loud noise" was (he 

instrument of 7r2gh[en H . This would be because of both the Verb-frame knowledge that loud 
noises often can be used to frighten and the noun -J ram* knowledge associated with dogs that 
indicate that "noise" i« not "possessed* by Iherri. On the Other hand, in 55. the noun-frarme 
knowledge for dogs would surely indicate that they typically have "fur", her>ce the phrase "with the 
brawn fur' would be understood to be modifying the noun "dog". In £6, we suspect that cur 
population of leaders would be divided, depending on whether their frame for "frightening* 
contained the typical instrument of emitting a loud noise, describable as i bark, with a stronger 
emphasis than their frame for *dogs" contained the default property of emitting noise by barking, 
(This, extension of frames- to nouns has not yet reached the point of development in the form of 
programs. However, a. recent thesis by C. Drake OMD extendi ihe details of the theory beyond the 
outline provided by Minsky to a form that is closer to the implementation stare.) 

In the previous paragraph, we wrote as If the tickle frame could have dtreci. explicit 
knowledge about feathers. A fundamental and pervasive das* of problems concerns the relative 
feasibilty of achieving this as compared to various schews for indtrta reference io feathers via 
some svptr-itt en- description. For example, (he tickle frame might kho* that "long, soft objects" 
are good instruments for tickling and Somehow use this as a means of finding a connection to 
feathers. Mos; early workers in Al (like most COmempOrary psychologists) were Strongly averse to 
the direct approach and so a great deal of attention has been given to the invention of "'search" or 
"information retrieval" methods to mediate the indirect approach. Current thinking (especially 
amongst frame-theorists) Jeans towards very much more direct knowledge than was prevlouily 
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considered feaiibl*. This places a hea^y demand on n\t*wy, and less on retrieval mechanisms. 
Thii appear! to jome so be "heavy-handed" for, we do know that human memory has a, very 
large capacity and no one has been able to propose very sophisticated retrieval mechanisms. So 
perhaps it is nature (rather than the theorists) who has adopted She "heavy handed' solution. 



2.6 Extending thejrame idea to a Theory of Context 

Procedural frames which are nor restricted tn verbi are powerful lools for dealing with the set 
of problems traditionally called "context", indeed, once we break away from the verb-centered 
sentence ai the unit for analysis, if becomes possible to tali about "scope" in a possibly very iar^e 
text. 

This too is a topic oF current research and there are few programs implemented aj yet. One 
of the syscemi that does exist, however is Schank's successor to the M ARGIZ system that generates 
paraphrase's of simple Stories {Schank, 1975), The esseniia] theoretical idea in the new system is the 
existence of frames [called scripts by Schank) that describe typical activities in terms of his 
Conceptual Dependency primitives. An example U the following informal sketch of a frame for 
g Qing to a restaurant: 



Script] restaurant ■ 

Hois: cuBttKr--, uaitra-as, chef, cashier 

Reason; to get tg$d so at tn gn doun in hurtfl*r and up in pleasure- 

Scans 1 Entering 

FTflANS - go in!o restaurant 
■ rffltMLD - find table 
PTftANS - B o to taale 
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tHjNE -sit dcnn 

Scene 2 Order log 

MRAN5- - receive- m»nu 
A" TEW: - ook al it 
MBU1L0 - decide on order 
WRAMS - tell order to waitress 

5cer.e 3 Eat ing 

A TRANS - rscoivB fae>d 
INCEST - eat food 

Scene 4 Editing 

H7RAN5 - ask for chack' 
ATRANS - give tip to uai tress 
P7RAN5 - qn to cashier 
ATRANS -give money to HBhiir 
PTRAN5- - go out of restaurant 

(Schank, 1575, p. (17) 



The image of understanding a sturf jj mapping the sentences of the itory into the actign* 
deitribed in the frame. Unstated fact* described in the frame iff assumed. Thw, mapping a 
story in which the sentence Appeared: "john left a tip* into this frame would alto* the system ro 
know that the "tip* was left for the wiJirea. a f act contained in the ffamc but not in the itory. 



2,7 Frames for Them at is Kr.ow i e-.r.^e 

■ 

Our previout examples haw* distuned eKteniions of the case frame idea to incorporate into 
the frame a larger fraction of the knowledge associated with comprehending a wruence than is 
included within the purely grammairtil realm. We would like to conclude by extending aur 
representation beyond the ScnlenCiaJ level to include the lundl of kno^lcd^e tha: hnl. leniences 
together. Thii represents the frontier of current research and sq we will ha*e few forma] example* 
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and have to content ourselves with some observe tiom which indicate the direction in which we 
expect this research ro go. 

To develop (hit notion of thematic frames let us wmiider the problem of undemanding a 
fairy tale. Our goal is to represent the knowledge tha; links sentences in a si ory together into a 
coherent whole. To begin, we will first examine some work by Rumelhart {1975) which address 
this question, but utilizes a phrase-structured grammar (augmented by semantic interpretation rule*) 
to represent, the structure of the srory. Typical of the rutei that his system possesses are: 

RULE L: STORY -> SETTING * EPISODE. 
The selling is a statement of the time and place of the story as weLl a.$ an introduction for the main 
characters, e.g. "Once upon a time, in a far away land, there lived a good king." ■ 

RUL£ 2; SETTING ~> (STATIVES)* 
Episodes conMiT of events followed by reactions: 

RULE 3; EPISODE -> EVENT ♦ REACTION 

An event is the mow general category of the grammar. 

RULE *. EVENT -> EPISODE or CHANGE-OF-STATE or ACTION 

Otr E VENTRE VENT 

A reaction consists of an internal and an overt response: 

RULE 5: REACTION ■> INTERNAL-RESPONSE ♦ OVERT-RESPONSE. 
There are a large number of in tern a ]■ responses. Two are 

RULE 6: INTERNAL-RESPONSE -> EMOTION or DESIRE. 
Ovm-responies can be either an action or a utit% of attempts: 

RULE 1 OVERT-RESPONSE -> ACTION dt (ATTEMPT} 5 
Attempts also have an interna] itructurr. ' 
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RULE 8c ATTEMPT -> PLAN * APPLICATION. 
The plan ii developed to accomplish a d«jre, The application is the actual execution of [he plan. 

For each of these rules, there are semantic interpretation rule* that indicate for example :hsl 
rule 1 meani that both the setting and the episode must be present white rule 3 (with the same 
syntactic %' connective) indicates mat the event is [he HUM of the reaction and precede* the 
reaction in time- 
Let us now as* how the skeleton represented by this Story grammar might be extended into a 
frame sys-tern. The bask change Is the addition of large amounts of highly structured, and 
particular knowledge. This information which we shall call dtfavtt knovrferigr includes 
information about the various aspects of the story, e.g. setting, episodes, martial and external 
responses. Indeed, the proposal is that the system knows a few "favorite" stones in extensive detail. 
Thuj, with only a brief introduction to the story such as "Once upon a time and Eong ago", the 
listener is able to bring f& mind an elaborate letting corresponding io his primary default — 
perhaps the Camelot setting, the Black Forest letting, or Ihe humble cottage depending on the 
idiosyneracics of the frame system of the individual listener. Of course, conflicts will arise from 
difference between the story and the default, causing one of ti*o results: cither exceptions a;e 
entered into the evolving database representing the sterns comprehension of the story, or, if the 
conflict is too radical, the entire default jetting is replaced with another. In either case, the result is 
that far more detail than is actually stated in the story is assumed by the listener through this 
mechanism of default knowledge. 

Jua as there can be default knowledge about siaws such as setLing, so too can there be default 
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knowledge about the expected jequenw of episode* in the story, whet* each episode ii itself a 

frame (tg\ a frame for the trim] state -■ a damsel in distress, followed bv a sequence of frame* 
describing a Typical rescue operation by a handsome prince and hindered by a tricked witch}. Thts 
organiiat:on of frames into stale action sequence ii essentially the Script idea of Schank's 
discussed earlier. Default knowledge about espied theme?, nwak. narrative structures may *ha 
be included. We expect, for example, for a fable to have a mural, fw the prince and princess to 
live happily ever after in a children's fairytale (though not in the original primitive folktore). 

The punt of this discussion ii to artkubite this model of comprehension Jn which the process 
is not one of literally understanding the text, but instead is more one in which the tear triggers 
rich, highly structured knowledge packets (hat supplement the literal content, provide expects (toni 
for the remainder of the story and place the story in a. context of related knowledge. 
Understanding is seen as essentially a process of evoking and ?hen debugging existing knowledge 
packets. 
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3. issu es to rj&mpurahonal EpisreniPlo-ay 

At this point, we have seen how the lm &U L«k theory of caie gramme hai been extended by 
AI to b KDme a kncwkdge^rimiftl theory of language. We now step Oitk f rum a purely linguistic 
orientation in order to gain a perspective on the general theory uf intelligence and knowledge 
embedded m [his point of view. Specifically we shall examine: 

(1) the manner in which frames exhibit a readmeu to us? new. flexible and 

complex data types; 

<2) the fashion in which frame* represent a synthesis of procedural and 
declarative knowledge.; 

(3) the sense in which the um of an anthropomorphic metaphor has become a 
technical device in AI theory- 

(*) the trend towards the representation of knowledge in larger, more structured 
units (which frames, exemplify 

(Sj a concern for allowing useful interactions between diverse kindt of 
knowledge; 

(6) a desire for expertly controlled practises. 
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3,1 Representing Declarative Knowledfl a 

A central concern of AJ u the GorHtruHJon at new datatypes and the exploration of their 
capabilities. The fra^e idea i* jejIE evoking m AI, and *& it is dif ricult to give a precise 
definition. Miniky give* the following description of a frame in hu original paper on ;he subject: 

We can tnin* of a frane j& a network of nodes ancf relations. The 
"top leveJg M of a frame ar*. fixed^ and rspreaent thmgs that sr# 
aluatfg true about tn* eupposed situation. Tht lower levels have 
many terminals -- "3 lot*" that ausf be 'fitlorJ Oy tj>«c i f i c 
Instances or data, Each terminal can specify condition* its 
assignasnte must neet. [The assignments themselves ar» usually 
smaller "sub-franes. "J Sinple conditions er* specified by 
markers that might require a terminal assignment CO be a person, 

I 

an object of sufficient value, or a pointer to a sub-frane of 3 
certain type. More COHpla* condi t ions, can specify relations 
among the things assigned to. saver 3 F terminals. 

iriineky, 1375, p. 212) 

We iw from the above description that the traditional itolmltw data structure of e&jeeta, 
properties and relations plays an important lofc In the definition of a frame. This ajp«E of frame 
theory Has Us roots in earlier wort in A3 on the representation of knowledge ai attribute-value 
pairs (e,£. the BASEBALL program {Green, L9S&) and a> semantic new <Qu Lilian, 1968). The 
notion of a Semantic Nee has already had an important impact on cognitive psychology, cf work 
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by Anderson and &ower {\$1$} and Norman (1975). The additional ideas added by frame theory 
which generalize the notion of a Semarac Net are: 

(1) the organization of TheSc rciatjons, no; in;o a uniform net, but into a highly 
structured set of concern {the frames, themselves); 

(2) the inclusion of defaults for properties and relation! representing the 
expected values and supplying common seme knowledge about the content 
(possibly not supplied by the actual ItAl}. and 

&) The jfl fusion of procedural knowledge- which we shall diSCUii below. 



3.2 R&pre 9 anting Procedural Knowledge 

In sections 2.2 and ZX we saw jn formally how procedural expertise ran be- associated With a 
frame in Ihe farm of a. Fratite-Keefar, a procedure whose action u to examine [be curre-nt context 
for data that cm fill the slots of the frame. To further clarify the concept, we begin with a 
gedanken example of bow in AI program might emulate a certain aspect of the way a 
mathematician thints. The example goes right into the domain of the logician and proposes, a 
Very different way to cope with [he existence pf pai^uxej such as [he Burali-Forlt paradox m 
mathematics and such paradoxes in ordinary language aj the. use of self-reflexive assertions, such 
aj "All Cretans are liars* (spoken by a Cretan and meaning; Cretans never tell the truth). The 
traditional logician's recourse is to seek an axinmatmtion from which no s-wh contradictions can 
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be deduced.. Our theory of what an working mathematician actually does li Id jsi principles of 
reasoning whuch could easily lead to paradoxes Lf they were ;o be applied in That direction. But 
the mathematician knows this and refrain* from usjng them m directions Vnown to be dangerous. 
It u as if he posred a sent 17 near dangerous places in his intellectual terrain. The job of the sentry 
ii to issue warnings if any reasoning process comes ion close to the danger spot, In Ai we see as 
our job developing theories in which the concept of such 1 sentry an be easily formalized. 

The anthropomorphic idea of posting a sentry i; typical, we maintain, of the way in which 

sensible people have thought intuitively about this, kind of problem. The contribution of computer 
science ii to show how to translate this intuitpve. matter into a tech meat and well defined form. IF 
one represents 1 knowledge system as a set of propositions, the idea of introducing the sentry is 
outrageous, or at best a llteraiy conceit. But If thfl knowledge system is represented in the first 
place as a set of interactive processes, the sentry is merely one more process. Naturally one hai to 
be sure that chls new process is well defined. But that is a technical problem. The conceptual 
change is that it is now perfectly natural and coherent to introduce such entities. From this 
perspective,, a fundamental contribution of Al to eplHemology is made clear; and that Is. -.he 
systematic introduction of active agents tmo epistemologica.1 theory constructions, so that, for 
example, an item of knowledge, a concept, or whatever, is seen as an active agent rather than a 
passive manipulative object. <£ee Greif and Hewitt (1975) for recent work in this direction.) 

The gecanken estampLe from mathematics seemed to us wei: suned to illustrate the force of 
this concept in an intuitive way. But there ate real applications applied to quite real problems. A 
well known example as Chamialt's method for dealing with reference problems in children's stories. 
To illustrate this, consider the following story fragment from (Charnjak, 1572). 
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were 



Today was Jack'* birChday. p enn y and janec *ent to [he tfWfc They 
gpine to get presents. Janet decided » get 4 il[f - DonV do tlwt . aid Penny 
"Jack has a kite. He will make you rake « back* 

The gca| » to construct a Lheary ( hat explains how the reader understand Chat "it" refers to she 
ne» 1^ not che one >di already imttl Purely syntactic criteria (such « assigning the Cerent of 
"«■ to the last mentioned noun) are clearly inadequate, as the result would be to mistakenly 
underscand the last sentence of the story as meaning that jack will make jan« ca>e back the kite 

Resolving reference is a traditional problem of linguistics that cannrt be handled by purely 
*ynia«k techniques. Our solution will evolve the raUatJon of our intuitu Trame-keeper" as a 
prec.se procedural entity, the <f m . Thj* ,* a production-like entity tha: « atlIvated by lD ™ 
aspect of the sute of the computational -vprid ^chin winch it i? defined. But we witl poitpone the 
detail* of this solution until section 3.4 in order to first give «me perspective on the underlying Al 
philosophy. 



3.3 Representing Know ledge in Highly Structured Faekgia 

The fundamental Frame assumpslon Is the thesi* that four ttteattans tkttn m think sr* Wfj 
as ntu as w rtm*. Mo « si tu aciSrts m whJdl pfflpte rjrid thamdvs haV(? iljffkJem irt comni<>ft 
with previously encountered situation* for the altat futures to be pre-analyz* and stored In a 
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situation specific form. Whenever rhjj happens v*t are able re ute "pre-digeaed", highly particular 
knowkdge -- *n explanation in sharp contrast with the view* of (nidligence propounded by 
theorists who would prefer Ed believe jhic our minds alwayj work {and bave to worU) in. a 
deductive fashion from very general principles, 

For our kite exampie. tt Li clear Char one cannot know that "H" refers to rhie new kite without 
knowledge about the trading habits or our society. One could imagine a different world in which 
newly bought object a:e never returned to the store, but old ones art. The question we raise hew 
is how this knowledge might be represented, stored and made available to the process of 
understanding Cbarniak's story. Td see the jp«trum of possible theories consider IWO extremes; 

.'■■'': 

An Extreme In the Fiame Pi ration: 

' ■ 

The listener h!U Stored away in long term memory a fram* for birthday parties in which 

certain events and proems are explicitly reprinted. These include the problem of choosing a 

present *vith quite explicit reference to the three -or four "possible dangers to b* thought about" 

including "recipient wont like it and will be disappointed" and "recipient already has one and will 
retum it" 



An Extreme ifi :he Anti-Frame Direction 



From a strictly logical point of view it is not necessary to have a pre-mnstructed party frame 
in order to assign the proper referent to m u". It is unnecessary to have been to a party or have 
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given a present. I; wouW be sofftdent <c have stored ^ tv &eoe*al u* items of informauon 

such 41 \bew about rcturrung gifts: 

U ) <(n hat bsen bought at atore y) and {* U m good etmd i t i on 3 I 
flip I ia-s £?tor» g m !l BKCfiangi *■ fpr i) 

12} Hk is nun) and lx nag not basn useul I 
implies (x I? in goc-d condition). 



easier 



The advances of this m«hod (if it js w^aMe) art apparent; Ira memory space, 
updating, modularity, etc, The ma JOl disadvantage ii that we [hen have [0 face tne question of 
how these iwo items of information are chcsffl from :h* p,„umably very large stock of knowledge, 
presumably including many other items raLher similar to that. Ho* does the interpret* sysiem 
know which items to u«? 



to 



An even more serious problem takes the form: how don the system know rhat agfc appeal 

knowledge abo u ; the world is necessary? Are we to assume that simple syntactic rules (e.g, based on 
ward order and proximity) are never used? But if they are sometimes used, what restrains them in 
this case? 



more 



Bv providing possible answer to this question, the frame theory shows itself to be much 
than a method for. information retrieval or Vurisiic search." One such answer is chat Charmak's 
reference problem simply doe* not arise in (he sense thsr the Birthday-Present- Frame-Keeper wj|] 
already have assigned (he new kite to a slot for -returned pttf before the pronoun is processed! 
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3.4 The Demon Solution to Mte RefefoncEi Problem 

Charniak's 197? research on const: urging a model for compel ending children's stories 
predate* the articulation Crf [he frafrw idea,, but presages important notions of that concept in 
several ways. In particular. His formal realization of a fratnt wai in [he form of bafrknmltdge 
about a large variety of situation! that arise in the cmiHt of these stories. The mechanism of his 
program was for the content of sentences to evoke this base knowledge with the following effect: 
demons ("framexBepBrs" in our terminology) weie created to monitor the possible occurrence in 
later sentences of likely (bui not inevitable) consequences of the givpn ^uailon.. Thus, for our 
story fragment the birthday Knowledge creates expectations about the need For participants of the 
party to buy presents and the possible consequence of having to return these gifu. Hence, these 
demons expect the possibility of Jack already poisessLng the present and the resulting need for 
Janet to return tt, where St is known to be the present. 

It is interesting to note that this demon about presents cannot be applied unlets several 
common sense inferences are nude. For example, the demon might be a procedural representation 
of the following fact: 

[f S person PI doe& not ua*t a present X p thtn 
he nay request the giver P2 to return the pre&ant. 

For the demon to be applied, the antecent, i.e. that PI doe? not want the present K h must be 
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triggered by an assertion in ;he database being created to represent the meaning of the itory. (The 
database may be thought of as containing the kernel sentences of the story w,th each word mapped 
into in proper «n« and reference problem* disambiguated). Bui ihe story does nor inelude 
explicitly 411 important facts. Look bat* at the KOry. Seme Kaden w ffl br surprised to note thai 
the text irseir d Mi rot state (a) that the presents boughr by Penny and Janet were- Tor /act, (b) 
that the top bought b, Janet was intended as a prtH! „i, and < c > [hat havmg an obje£[ impU „ [ha , 
one dcej not want another. All of ;he above facts are inserted into the database by other demons 
made activated by the btrihday frame. Thus, CharniaVs notion of a demon 1* particularly 
satdfying because not only do demons handle reference problems, they are also used to insert into 

the story database various reasons, motivations and purposes not exphetl, stated. 

I • 

In the current view n f Tram* theory, chU insertion Of unsaid information a&sumes a 
paramount role. From thjj. point oF view, it i 5 not accidental that so moth is left out of the story. 
Rather the typical situation in comprehension is to be fjeed with a set uf clues that evoke a rich 
and detailed knowledge structure, the frame, that supplier :hr umra™ details. Naturally, these 
default* may be mapproprtaw for some situations and. in those cases, the text musr jupply the 
exceptions. Bur in general, reference, deduction and comprehension are made possible only by 
knowing a great deal about the station being discussed. 



3,5 The Virtue of Particular Knowlad ga 

AI is a young science and the ideas thar we have discussed are under constant development, 
elaborate and change. However, tt has been our pal a give the reader a feeltng for the kind of 
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enterpriM [hac AT is rather (ban a final statement on iu (h«ore:ki| f 



orm . 



A central point hu been the fuhim In *hich many kind* of knowledge play a role in 
cognitive process**. In partjeutar *e con S |de„d for ,he referent problem the need for keal 
World versus Syntactic Knowing* and. more profoundly Fibular verius General Knowledge. 
A major pojnt of our anilptt, j,e. of the Knth^e as oppwed to Power iheory of intelligence, it 
thu J>«« C Hbr ftum/afp tan often do the job for which G-W jnnufa^ is traditionally 
thought io be neemsaty 
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4. Sell-Knowledge 

In sections 4 and i. we kavc the context of Frame Theory in order to broaden our m&ighi 
into the notion of Al as a new kind, of Epistemology, To da thii. we jhall discuss (wo concerns 
which are central So AI theories but rarely appear in more traditional [emulations of knowledge 
and intelligence: the first is the formal representation and use o{ self-knowledge a nit [he second is 
the importance of control structures. 

I 
41 Kinds of Self-Knowrtedgg 

As we shall see, an interest in S elf-Knowledge will not take us outside our concern for 
Understanding the comprehension process Indeed, we shall find that this kind of knowledge plays 
an obvious rote In allowing the system (whether computer model or individual) to answer quest ions 
about its nwn performance as well as a role in the k?s obvious situation of handling special cases 
that arise in parsing the syntactic structure of the utterance. 

■ 

There are many kinds of self-knowledge. Perhaps the simplest Is the ability to observe one'* 
own behavior, allowing the individual or program to answer elementary questions about Its actions. 
A more skillful u« of self knowledge would involve being able to take account of these 
observations and direa one's problem solving according, still a more soph<«icaied program 
might, In certain Circumstances, Utilize this self-observation to learn by debugging fault* in its 
procedures, generalizing us knowledge, or perhaps suitably special iung it to form more efficient 
techniques for particular problems We shall Llldstiate each Of these kind* of self-referential 
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abilities below in relanon to the Blocks World, a favorite mini-world for A J programs in whtth % 
one-armed robot Is asked Ifl build vanoui Jtruttuies from blocks placed an a table (figure 3 >. 



4.2 The SHKDLLJ Program and the Blocks World 

Winograd'j SHEDLLf program represented % iou* dt farte of design and an important 
benchmark tn the f*alm of natural language ctjmjjiehcnsicm by computer. In this section, we 
introduce the program briefly and in me next two illustrate several way* tn which it manifest* 

rudimentary self -know ledge. 

. i 

- 

SHRDLU is a program tha; convened such Mrnen:« as "Pick up ;he big red block and put it 
In the box" into commands iTur a btocks world problem solver The success of its comprehension 
was. demonstrated by th<2 program successively carrying our the instructions in a simulated world 
displayed on * cathode ray screen which contained a one-armed robot, a cable, a. box and a 
cofbciiQri of cubes and pyramids. 

» 

The progam u capable of comprehending sentence that reached the degree of complexity 

illustrated by the following examples. 

■ 

(I) Find the block whLch is taller than the one you are holding and put it into 
the :-.-.: 

{2} Will yiau please siitk up both of the red blocks and either a green cube or a 
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llfE ROBOT'S TJOEltl 




Figure 3 
"Fick. up t bi( h t£ „ d block." 
Proa (ttlnoflran IflJJ, p. $} T 
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pyramid. 

(3) Does the honest thing [he tallest pyramid's support supports support 
anychlng gt*en> 

In SHRDLU. there are a multitude of different knowledge structures. For example, the 
intern maintains even^hiuory Lisa recording each event, why it occurred and what it 
accomplished: numerkal eoordinaie data structure recording the dimension and location of objects; 
a symbolic database jn which assertions about the state of the- world, about global knowledge and 
about active procedure are iiored; and a Fodor and Katt st^le marker hierarchy (Rati. 19(H) 
indicating the relation between words. But, perhaps most significantly, a major part of the system* 
knowledge was represented directly as procedure thereby achieving far more Flexibility and power 
thin more traditional "declarative' representations. Th«e different knowledge packets are used in 
many ways- Do answer questions, to perform actions, eq update its knowledge. 



4,3 Answering Elementary Why Q uestions 

An elementary example of idf-knowledge is found in 5HHDLIA cap^ny to generate name* 
for assertions in Its database, events, noun groups and «her entities. With these names, the system 
can refer bat k to these entities, associating mem with times, place* and other m format um Thi* 
allows the system to answer elementary kinds of questions about its behavior. Figure 4 illustrate* 
this ability. 
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23, had you tnueW any pyrniUd >,elore you pur the green one Oil the 
liCClfi cube? 

TES t THE GREEN OWE, 

The jy&ieil deduce* f T ™ Ito memory what event is b-einj; ruf erred CO, and 
■Him relate Other events to It in til*, Th e analyst Includes ecmpJei! Lenses 
of verb sroiqra, and there are npecial faciliti* E for keeping track of the 
vatlaiiB forms of irregular iraTbfi. like "have." 

2k* vhen did you pick it upT 

WH1LS I WAS STACKING UT THE RED CUBE, K URGE RIO BLOCK WTO A 
LARGE CJLtfcN CUBE, 



A rl-iM? Is described by nan lug the n*in jjjtha.1 vbicli waft being worked on. 
15. why? 

Thfl b-ySLem reneiLLerfl lnaed Lat ely previous aentimeee LO imderatsnd ques- 
tions llhe this and the (^limine, jyrcsft, 

TO GET RID OF IT, 

By keeping track of selected pare ft of the original aubgDal tr*», the 
Byactm baa Jinm? undent Landing of It* own rmtivflS, 

2*, why did you dg Eliai? 

TO CLEAK OFF THI HED CulE, 
"That" refers C* the action merit lone J lil Che answer, 
27* why did wuu tlojir efi chat cwfce? 
Htre ¥ "that." neans che one juhf. referred to- 

TO FUT IT ON A LARGE GEEEJ5 CUBE. 
JB. iriny djld yu U do that? 

TCI STACK Ut TKE RED CUBE, A LARGE fcEn BLOCK AML A CARGE GREEN ClTBE. 
:':!■ w!"- d:,i :•.,, >.'..: L::n-: 

BECAUSE VOl 1 AStfEfr HE TQ, 
It remembern utiat it did T not how the request uas worded. 

Flfrur* 4 

ZKampliis of SHEULU'S ability to anewelr WHY qwroilbna. 

Fr<W (Hinograd 1972, F* 13) T 
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4.4 Sdf-refer-ence in Gf amntatieaf Systems 



JOTIS, 



ic manner 



A second level of ssbf-referentjal ability jn which the system «n not only answer quest 
but make decisions based or. self-knowledge, is ato illustrated by SHRDLU. This is ih 
in which the system has access td n« only the current state of the parse, but alio to the past history 
nf the parse. This is illustrated by the fashic*. in which Wtnograds program parses conjunctions. 

To parse sentences with words like W M the intern invokes a special routine upon detecting 
the conjunction which notes the last constituent formed and attempts to find a duplicate constituent 
following the conjunction. If unsuccessful, it fooks further bat* in the h.story of the parse until a 
wbitrueture can be found that symetrically appears moving the con junction. A similar capability 

H found in Woods Augmented Transition Network approach to parsing (Woods, mO}. Elements 
of this approach are discussed in section 5.1 on Controlling Grammatical Computations. 

It is Cresting to note that "and" is not handle 5 in>pl, by augmenting the basic Systemi: 
Grammar of 5HRDLU. By some standards, thb moving outside the formal.sm of one rheoret.cai 
mechanism would be unaesthetic by being contrary to Occam's * aiw . But experience has >ho*n 
that the management of diverse knowledge package! is the rule rather ;han the exoepMm tn the 
architecture of complex problem solving systems. Th, obligation (3 simply one oj clearly defining 
the Interactions between these knowledge packets 
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4,5 Debug gfn g Knowiecf^e 

I 
Recently, the Block* World haj M r Ve d as a domain for investigating wlHirto*kd$« rf a 

deeper Hind- This has been In the iVm of a system called HACKER (Susiman, 1371) whtch is 

capable of learning to 55 |v« block construction problem of the complexity that SHRDLU can 

handle. (Remember chat for SHRDLU. the problem solving procedure arr pre-programmed and 

not learned.) The learning is achieved through HACKER posting an expertise in debugging. 

It it ojr belief chat debugging knowledge u absolutely fundamental to the icqulittion and 

operation of any program as complex as human thought and language 

Although- this work was net done specifically a* part of a natural language comprehension 

system, It provide the basic understanding to enable such a system to answer questions about why 

a particular method was used h in a far deeper fashion than SHRDLU's elementary abUitici 

allowed. Knowledge of nnA o-n problem soling Jtrategiw supports an understanding that not . 

only could discus the chronology of events but couW also discuis the underlying plan* and their 

evolution through succeeding Ages of generalization and debugging, 

- 

To illustrate this, we shall develop an example from Sussman. HACKER initially might be 
compared to a small child interested ,n constructing block uructurej, but not very knowledgeable 
abou* the problem, invoked. HACKER begins with the ability to place one block on top of 
another (by executing the primi^e PUTON) and wim an elementary planning capability. A 
typical elementary goal is to construct a tower from three blocks, e* pressed formally to the program 
by che conjunction (COAL (AMD (PUTON A B> (PUTON B CM HACKER attempts to wive 
this problem in a 'linear' fashion by first planning 10 achieve (PUTON A B), and then as an 





A 




state 2; 


S 


C 


state 3i 


A 


B 



Goldstein, Pi pert fj ■ Kn&wlectge, Language and A I 

independent second goal (PUTON B C). This produces the ttquenoe of block Hates illustrated 

Trace of 1ACHIEVE (AND [ON A Bl (DN B D) 
stats 1: A B C 



?PUTDN A 9 has jus-( baan executed, 

C ;The top nf 6 hat been cleared at required! 

jCtre-requiBlte of PUTON B C. HACKER it 
I3ne-ha,npcd and irust lift blocks from the top. 



B 
State 4l A C ;PUT0W 9 C ha-g just b»an completed. 

Obviously a bug hai occurred- The Action of clewing the tap of B as a pre-iequiiite to achieving 
(ON B C) resulting Ln state 3 ha* undone the first Hib-goa.1 (ON A B) whtwe purpose was to place 
block A on block B. 

■ 

One solution to thli difficulty which AI program of the pait might ha*e attempted (and 
which Occasionally children puriue), is to return to the Original state and try some different 
construction itrategy. without attempting to learn anything from the current failure to guide fhc 
neKi attempt, HACKER contain* a ne* iind of knowledge, namely kn&wledgr of how to debug . 
This debugging knowledge know* that linear joiuitoiu are only fim approximation! and that 
sometimes interactions between the JUb-gOals nf a conjunction must be taken imo account. In thus 
particular case, HACKER rEeogrUMi that the prerequisite of one sub-goal CLEARTOP conflict; 
with the purpoie of a brcthet goal, ln such situations, it applies the debugging strategy of ieelUng 
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a reordering of the brother goals such that [he goal with the prc-requigjie ii achieved Tirit. In ihu 
particular problem, HACKER; 

(L) recognize* that the bug is due to the general bug-type of "pre-req unite 
clobber; brother goaf, 

{2} applies the debugging strategy of reordering the goals, 

■■ 

(3) edits in long-term memory the procedure Tor constructing towers of three 
blocks in take account of this, ordering requirement. 

H) records a comment to itself for subsequent problems of this kind that states 
that when building block towers of any iize, ON chains should be ordered 
from the bottom up. 

Thus, HACKER exhibits both expertise about debugging and about repair, coupled with the 

ability to examine us own problem salving goals and actions so that it is able 10 apply this 

debugging expertise ro its own reasoning. It represents a modet of skill acquisition whose 

competence ii based on self-kn&wtedge about its behavior and meia-knowledge abuut the na-ure of 
plans and bugs. 

We emphasize tbe importance of debugging si ill because a vital element of intelligence n 
knowing how to handle a vasr variety of situations. We cannot do this by knowing all about these 
situations because each real wortd situation is different. What we do need it 
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(I) knowing how a tell which »id situation u sufficiently like rhe present one, 
and 

m knowing ways to adapt - Idifcug" - the old procedure for handling ihe old 

problem to 4 n«* procedure that can deal wjlh ;hc new situation, 
i 

Th U! , syejMU like analogy, liro.larlty, ™a*phor are at the heart of the new formalism*. i (1 lht 

older methods of AC. linguistics, log(c> and psychology, these were eoha raising, hard-to-explatn 
phenomena. In the new, debugging technology, they become concrete, manipulate (yj3 „ of 
knowledge. 

Finally without a theory of debugging, our image of frame-based comprehension would have 
a fatal flaw. Understanding by using large packets or previously known Information reequires the 
ability to dtbug these knowledge structures for the particular appl.cation at hand, if the system is 
to be able to deal with navel situations. 
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concern 



We have seen the way in which Al ep.stemobgy is opening new are „ , J[udy in Jts 
For self-knowledge and, in particular, debugging *k«|. A second dimension along which AI b s 
approach !0 epistemology is opening new horizons and fundamentally differ* r«m older 
approaches Ji its attention to ^rrf jtwfKrr. Indeed the idea of control structure could scarry 
esiit until one begins to see knowledge as made up of ict^ agenu which need to be controlled- 



5J Controlling Grammatical Computations 

i .: ' . 

Tirst-order* theories *uch ai chose composed of generative grammars are constructed under 
strong aseptic™ of modularity and independent*. These auumptions are heuristic ones dial 
enable the episremolog.cal analysis to get underway, but include an expectation of a subsequent 
need to study the interacting. Controlling tonpuration by choosing between rhe relevance of 
alternative chunks of knowledge, deciding when a given approach should he terminal, and 
generally guiding the reasoning process are among the important kind* of interactions which mutt 
be eventually understood, 

■ 

When context-free grammars were fir* applied to the problem of comprehending test (rather 
than only generating grammatical sentenced it was immediately recognized ihat more structure wis 
needed tf the parsing time* were to be kept within reasonable bounds. The essential problem |, to 
provide information for choosing whether a given rule should be applied It is thu basic concern 
that character the ba 5 ,c approach of both Augmented Phrase Structured Grammars (Heidorn, 
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1975) and Augmented Transition N*ts {Woods, WQ), The ATM approach has served as the basis 
OF several important language programs arid. wt shall discuss IE further below. (See for cK ample 
(Simmons* 1973) in which ATNi are used both for parsing- into an journal representation and for 
generation of English output from fhi& representation.) 



5,2 AuEment ed Transition Networks 

■ 

The Augmented Transition Network formalism for expressing grammars developed by 

Woods wii designed m extend the power of Finite State recognition grammars. Hii first step was 

ta generalise the finite state graph to allow recursion* thus essentially obtaining the power of a 

context-free grammar while still preserving the perspicuity of i network representation. A 

recursive all is indicated by the label of an art being a state name rather thin a terminal. This. 

I 
directs the recognizer to jump to ttur state, run until acceptance, and then return control to the 

Kate pointed to by the am Essentially a pushdown store has been provided to keep track of tire 

bookkeeping involved in allowing recursion, Figure 5 illustrates a Recursive Transition Net. An 

example of recursion occurs in the interpreter's transition from state S ro state Ql by means of the 

recognition of I noun phrase. 

Woods' next step along the dimensions of adding ccnirpl was to generalise these recursive 
transition n^woiks further and provide for ooth a preference ordering on the arcs editing a gruen 
node and arbitrary predicates which act as filters and must be tatisf ied for the node to be pursued. 
See figure 6 for a piece of such an Augmented Transition Net. 
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fi Sinple Rtcureive Trans itimi Hctuwrt 
Ftoh {HMdi 19 J D. P. 392). 
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With the additional augmentation of allowing the system to maintain renters hiding 

various pieces of information found during the par,,, the ATO formalism h« the full upabilit.es 
Of 1 transformational grammar, but .ncorporate* these abiHr.es in a form that admits of far more 
efficient parsing algorithm* than those previously propped f Dr transformation grammars. Thil 

approach ha; proved successful and now serve: as the basil of W»ds Lunar intern (Woods, 137i> 

and several systems constructed by Simmons (1*73; 1975}, 

Winograd's SHRDLU also addre&tcd the control issue. Woods added control knowledge to a 
state-oriented description. Wmograd added control knowledge by pursuing an entirely procedural 
approach, H* reprinted grammatical knowledge as programs. These programs, being 
procedure, could have conditional*, loops, variable*, or recursion. Figure 1 illustrates some 
simplified programs for mo&nixtng sentences, noun phrases and verb phrases (Expressed ?u 
flowcharts rather than In Wincgrad's Programmar formalism), 

Woods preferred to preserve some dement of declarative representation oy clearly naming the 
States: Winograd preferred the freedom of an entirely procedural desertion. The approaches are 
computationally equivalent {Winograd. lflS) >nd both represent the fashion in whkh first order 
grammatical theories can he extended to support efficient computation. 

Of course, most AI theor.es are computation ally equ.*al*nt just as most programming 
Unguals can through more or less fchar be made to emulate each other. This u not, howe.tr, the 
real issue The critical question is the perspicuity of the language or representation for the 
particular applications envisioned. Woods emphasises Jhe virtues of a net description: Wmograd 
prefers expression directly as progra-s. 
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FLmJchartc of elnsililfcd, PR11CMHUB STanmars for ckhih phnaeB, vert phrase, 

mil& &«Fi tenets 

From t¥iTl&trad 197Z. P. S3) , 
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5,3 An Issue for the Future — Deterministic Parsing 

A concern for control structure raises an obvious question for the Future will Lt be possible to 
organize grammatical knowledge (including interactions with other kinds of knowledge) in such a 
way that back-up jj eliminated (without recourse to paraltelisro which is ultimately equivakni to 
back-up) at least for those sentences which one intuitively feels, do not have a multitude of 
in:p|.jiri:ii.K::ii.. 

This goal of eliminating exhaustive search represents a natural concern of the Computaimnal 
Epiitemologist. First define the basic knowledge Jpate. Then analyse Us procedural structure. 
Viewing the process of finding a solution as a search branching at choice pc?mls 4 the question 
arises: "How is this branching controlled? 4 One possibility Is to allow for backup. A second is 10 
utllii* paraikelism. A third U to have diagnostic experts :hat can guide the decision at choice 
points so that bacitup is minimized or perhaps even eiiminated- 

This evolution ha.s occurred most notably in the development of programs capable of 
symbolic: integration. An early program by (Slagta t IMS) had knowledge of a basic jet of 
integration formulas, but combined this knowledge by means of a search. Recent programs 
following Moses' design of SIN (Moses. 1967} have a deeper understanding Q f the nature of 
integration and do not require any tea re h to find the integral. We do riot necessarily expect 
language programs ro evolve to the extreme form oF deterministic problem solving that the 
integration programs Have reached, but we do expect a similar kind of evolution in thi 



ie same 
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direction. 



Indeed, insight into grammatical knowledge and knowledge representation techniques has 
reached the point where one can seriously consider a backup-free approach. The issue is not 
decided, but the suggestions are promjjmg. For example, a backup-f ree scheme hai recently been 
proposed by Marcus (L975J. He calls hjs approach "wad and see parsing*. The basjc idea a thai 
backup ii made unnecessary if the system ii provided with a, limited look ahead of only a few 
words (or phrases} and if sufficient constraint is provided by semantic and pragmasic knowledge. 
He IS Currently constructing a case-frame based parser follow^ these guidelines. 

■ 

Another proposal by Riesbeck {1975} invokes a somewhat more radical approach. Riesbeck 
thinks chat grammatical knowledge should be thoroughly integrated wiLh semantic and pragmatic 
knowledge to such an extent chat it is no longer reasonable to speak of separate components. This 
represents the extreme- position along a nasuia] line of evolution of more and more intimate 
interaction between grammatical and other kinds of knowledge. Riesbeck has constructed a- 
comprehension intern that reflects this philosophy. Words in the sentence trigger "production s" 
(called demons by Riesbeci.) whose goal is to build a representation of the sentence'* meaning 
(using Schank'i conceptual dependency structures (Schank, ffl®). Some of these productions reflect 
grammatical knowledge such as generating the expectation for a noun following the appearance D f 
an article. Others represent semantic and pragmatic information such as expecting a human 
recipient following an appearance of the word >ive - . ftjesbeck proposes that such an organizanon 
has both conceptual elegance as well ai th* ability to pari* deterministic* I ly. 

A new "conceptuar grammar by Martin (1975) also promises to generate perspicuous, backup- 
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free conversions from sentences into expressions in his semantic representation. Martin's work 
suggest the poiiLbiUty that an internal representation mjghf be closer la the "surface" Structure of 

natural language than most workers have believed feasible. 

Clearly there is an important interaction between grammatical knowledge and other kinds oT 
knowledge. A purely pipeline model in which a grammatical analysis it followed by a semantic 
analysis and finally a pragmatic ar.aly.5u is untenable. However, it is not yet clear which 
alternative among- the following- is preferable: ([J structuring the grammatical knowledge as a 
modular expert engaged in a dialog with other expert s or (2) distributing the grammatical 
knowledge with other kinds of knowledge in such a Tajhion that it is no longer appropriate to talk 
about separate syntactic, semantic and pragmatic components. Tn any *as*> it is our expectation that 
in the neat few years language systems will be constructed in which backup does not generally 
occur (except in the case of garden path sentences which are equally confusing- to people}. This is 
based on the observation ihat most procedural theories initially require backup, not because of the 
intrinsic: properties of the problem area, but rather only because sufficient knowledge to 
disambiguate alternatives has not yet been incorporated into the system. This knowledge has 
gradually been formaliied (as nur earlier discussion of work by Winograd and Woods showed). 
Consequently, we are evolving towards the Mage Where knowledge It available to make the right 
choices at each decision point 
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6. The Classical, fiomantlc and Tflattern Periods oTiiU 

■ 

6,1 A Change in Perspective 

In this essay, we hive tried to fJvt the flavor of the new perspective from which M 
approaches traditional problems in linguistics and ^piitemology. Specific forms of this general 
change in peripectlve are related to * number of theoretical issues;:' 

- Che relative Importance of procedural as opposed, usually, to "declarative" or 
"proposition al" knowledge 

- the shift away from traditional logistical concepts (axiomitizations. logical 

deduct Ion J for the representation and manipulation of knowledge to more 

I 
highly structured and particular representations; 

- liberation from the need for highly uniform transformational method; by 
allowing procedural sentries or frame-keepers'' to populate the knowledge 
'system, 

In each case we note a trend away frcm a traditional form towards a newer form. In each case 
experience has already shown us that tuch trends can go top far, and that there is often good 
reason for using the traditional forms. So, looking tack over the brief history of Al. we see a 
dialectic process with three phases which mi|ht be termed the Classical. Romantic and Modern 
Periods of Al. 

■ 

In the earliest work (which might be termed the Ciatiltti Ptrlcrl extending from the mid- 
fifties throu|h the eariy siiuLrs}, knowledge is represented in very traditional ways: for example as 
subject - predicate propositions Of as important but relatively non -fundaments I extensions of this 
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to property lists. Reasoning is accomplished through uniform techniques HKh as Kirch or theorem 
p roving- 

Typical programs Of Lhc Classical Period were often crmaied by lawmen on the grounds (hat 

"people don't think like that/ (Of course, thus reaction may be totally wrong, siticc people dcs not 

I 
necessarily knew what deep processes he und«r the consensus layer* of thinking.) However, whether 

or not these objections are right in principle, the programs of what we shall term the Pomarmc 

Period (extending from the late sixties 10 early seventies) do not evoke them nearly to the same 

extent. On the contrary, they sometimes appear to be nothing mare than a new language tor 

introspective material Striking examples arc the Evans ANALOGY program (Evans, 19&9) and 

the Winston learning program {WinsEon, IS"?0). 

This stage might be justifiably termed the Romantic Period for two reasons. One is the 

extent to which it represented an introspective endeavor, while the second is a notable tendency to 

argue for first one approach and then another as a complete lheoiy of intelligence. The period 

began WLth an emphasis on network-like systems (Qui I ban, ] f j&7) and ended wjth a (.on ccn [ration on 
' i 

the representation of knowledge in an entirely procedural farm (Hewitt, 1971; Wmograd. 1972). 

■i 

Finally, in this the Mcritm Period, we begin to Investigate more rigorou.il/ the conditions- 
under which difFerent kinds of knowledge are appropriate. Frames are extended from an 
essentially .declarative data type to a procedural system through the incorporator! of frame-keepers. 

Procedures, in order to allow debugging, are embedded in a web of declarative commentary 
specifying the purposes, requirements, bugs and effects of programs. Procedural knowtedge thus 
serves as a hails for action while prepositional know' edge provides a basis for understanding the 
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behavior of procedure* and tspeciaJLy for knowing; how :o debug or change them. 



S.2 Some Unanswered Technical issues in Frame Theory 

We have ni&cuued a variety of fundamental epitfemoiDgieal notions - frames, procedural 

knowledge, control structure — and have utilized these concept to provide a unifying rhread to 

recent Af natural language research. We conclude this lection wnh a collection of questions 

surrounding the frame idea which have yet to be answered and whkh. we feel will provide the 

focus for the next round of research in the fjeld. 

i 



Learning: 

A theory that makes extensive use of pie-stored knowledge obviously faces a critical question! 

■ i 
how is such knowledge obtained? Or, in Other wordy how does iUCh a theory allow novel 

situations to be comprehended? 

."'■•■ ' 

Reference jtnd Reasoning: 

Given that the system knows that feathers an be wed to tickle u'j& could successf ully answer 

a question to that effect), it still remains to describe how indirect reference or reference through 

i , 

deicMfJUOn is accomplished- Presumably, there are some tickling instrument* that we have never 
before considered, bur that we nevertheless are able to understand In thai role. How are such 

inferences made? 



T3.il iuil:ies with ^^rr-ft. 
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1 
There art a variety of technical problem* that arm* in implementing the idea nf a sentry as a 

demon. These include handling conflicts {i.e. situation* where more than we demon ji triggered 

by the current situation and each has an alternative meaning^ forgetting (i.e. when should a 

demon become inactive), and COiltcx [-switching (i.e. how are leaving, returning and combining 

contexts handled). 



Representation: 

Fn manjf ways, the frame idea has served as a metaphor for us to present various ideas about 
the representation of knowledge - bo;h declarative and procedural. However, there are many 
representation question? that must be resolved in formal Lung the notion. For example, how 
homogeneous is our representation of language to be? Are smtrLes {demons.} to be represented in 
the same form as the basic frames? Are current computer languages such a: Lisp adequate or do 
we require new programming constructs? 



Control: 



Is -knowledge of control implicit or explicit in a frame system? Control is explicit when in 
addition to basic facts the system also includes knowledge about when to apply these facts and how 

to resolve conflicts- Control thus becomes just another kino of knowledge to the system. On the 
Other hand, control is implicit through the representation of knowledge as procedures., with 
ordering in the program, cond.jriDr.nh and calls determining the sequence of application. Which 
approach is to be preferred? 

Level interactions: 

We have posited the existence of frame-based knowledge for verbs, nouns, contexts, and 
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themes. Technical queitians arise about how these vjnou.5 kveli of knowledge interact, h the 
organization essentially httrartMtal, with each level baiting only at the resulti of the interpretation 
Of the level below it. Or do all levels enacunfi the JUrfaEE I ing.*aiitlC jnjjUL (perhaps in addition to 
examining each other)? h communication oncasjonally fteleTartklcGl wjjh higher Levels sometimes 
directing the anitysJi of Sow*r levels and at QTher Limes being Jn turn directed by the remit! of the 
iovser level analysis,? 

This represent the conclusion of our theoretical dtomnism or Ai as a new style D f 
epistemology and its impact on the understanding of language and comprehension. In the next 
section, we consider the relevance of these ideas to education. i 
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1 > The Impact oT i^rf iTicial InbstiigepjiE an Education 

7.1 If the Machinal Get Smarter, Will the Children Get Smarter? 

The relevance of Artificial Intelligence tp Education tart be discussed along swo different 
dimensions corresponding to the niittow and broader viwi of the subject as "machine intelligence" 
an4 as "epistemology 1 '. The former view evokes [he prospect of more useful machines, sometimes 
described under the label "Intelligent Computer-Aided Instruction" (Brown, 1975). The other vie* 
has possible consequences of a much deeper sort, for it can be taken as projecting a total 
restructuring of the content and the theory of education. But even if this radical prediction is not 
fulfilled, the cognitive sryk of Anginal Intelligence has very immediate and realistic suggestions 
about what children should be taughc and how. 

These two views can be stared in a more dramatic wajr. The crganiiers of the HUMRO-N5F 
conference on "Ten Year Predictions of the Impact of Computers on Education" (HUMRO t 1975> 
chose as the theme of one session; If the machines get smarter, will the children get smarter?*. 
We think the deeper question is this: "Suppc.se that mart machines eah be constructed. Should 
educator* look to the machines themselves as instruments to make kids smart. Dr should they rather 
look at the principles which led to the machines becoming- more intelligent to see whether these 
principles Could be applied to children?" There is, of course, no need to choose. Our intention in 
so posing the question is to draw attention to the richer, hui almost entirely neglected, second view. 

This paper is not the place for a full dtSCUJllon of the relevance Of Artificial Intelligence to 
Education. Instead, we shall develop briefly three representative issue*- 
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<]) the impact on education of she development of ccmpuiej-based tutors with 
reasonable 5angua|e com pretention ^binrLrrs. We ihalt develop thti topic 
under the heading Intetitgttu Cmfiuur-Aidett i Attraction (iCAl}. 

{2} th« deeper question of vahethsr she concent of A[ theories nf language can 
have an impact on the very lubstanoeor linguistic studiei m the jchooli, not 
just in the nunne* in which the currant material is taught. We have chosen to 
Call this, topic The Gfoti Boy Approach Id Ul\g ttfrje Studttt. 



(D wperher the basic goal of AI in jet-Ling unified theories of knowledge 
be expected to have a profound impact on the entire educational curncul 



i-nn 



Lin, 



with the effect of decreasing the currently accepted division of education into 
disparate subject areas. We shalt dimisi this topic under the title "The 



Articulate Leaner". 



Again we remind the reader that our duoution is intended only to five the flavor of * few 
kinds of interaction that Artificial imeSligenw *nd Eduction mifht have. For a discussion of 
related areas of dearth In psychology, linguistics and AI. see (Winograd. J976>. For a deeper 
dlseuuion of the particular interact™* dbcuued here, see (Mmsky, P^rt. 137*- Papert. 1972: 



Goldstein, 1974), 
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7:2 Intelligent Computer- Aided Instruction 

The question cited earlier "If the machines get smarter, will the children get smarter?" 
immediately evokes the commen,! "certainly not autornatLCallj. - . This follows rrom [lie aeep-er 
OOservarion that the question is not primarily aboui machines, but rather about lhe nature of 
human intelligence and the effectJVencii of teaching. To at this, transpose it into the two sub- 
questiont 

(I) "Could an idealued personal human tutor make a kid smarter?" and 

f£> "Could all or seme of the functions performed by this tutor be mechanised.?". 

Answers to the first question will vary depending- on what one thinks about controversial 
issues in the theory of {human) intelligence. But there would certainly he less variation in answers 

to the question: "Could a highly LnteLligent, empathetic personal tutor enhance the intellectual 
developmenf of children?" {Whether this enhancement is to be described a* increasing the child's 
intelligence or the effectiveness of hLs use of it ii not germane here.) 

There are very few programs that can lay claim tn fulfilling the functions, of an intelligent, 
insightful tutor. However, important ground- breaking- research has been done by Carbonell (tSto) 
and Collins {1975a) working on the SCHOLAR Project, and by Srown {ffii) m his development of 
the SOPHIE program. The SCHOLAR Project has developed a senes of successively more 
sophisticated computer-based tutors for such domains as geography. The project has involved the 
development of natural language comprehension modules, the coniiru.ej.ian of domain -oriented 
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problem solvers baled an 4 semantic net approach to representation, and, most notabty, the 
procedural representation of tutorial stra^gtet. 

■ 

Recent war* bjf Collins (W?5b> hai involved a deep interaction between pjychology and 
Artificial Intelligence in the form of studying the tmcts Of reasoning thai people typically use to 
handle questions in #p<n domains, i.e. concerning topics about »hjch they knew a. great deal but not 
all of the relevant facts. People are typically quite good it mating plauiible inferences in su<h 
contexts. Collins 1 goal is to represent such reasoning strategies in a procedural way and then make 
that knowledge available to the SCHOLAR tutor for the purpose of allowing the tutor to help a 
student debug his own reasoning stj-k. 

Brown (]&7*J has developed a computer-based la bora tor j in which a student can learn 
electronic troubleshooting. The program SOPHIE covins an electronics problem solver that 
embodies a genuine understanding of the circuits Involved, *' natural language component that 
allows Engliih question-answering t occur between the student and program, and an 
understanding of Certain kinds of tutoring Strategies-. We shall not develop the detaili of Browns 
program further, exc*p; to remark On the following points: 

{\> the program Is 1 milestone in Computer-Aided Instruction <CAI> simply because of the 
extent to which it understands its subject matter. Previous CAI programs (and indeed 
most current ones) are notorious for their ignorance. The slightest deviation from the 
especced response on the part of the student in hts answer results in the program 
erroneously Judging the answer as incorrect. Brown's program utilizes a powerful 
simulation of the circuits under study to accurately predict the consequent*! of various 
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faults. This Imd Of domajji <:&mp«entp is essential as ft foundation for any CA1 tutor, 
and we expect (he incorporation of A[ problem solvers Lnto CA] program* to become 
mare and more commun in the coming years. 

(2) the tutoring strategies of SOPHIE are concerned with the nature pf reasoning 
involved in building up a model of a problem from incomplete evidence. Thus it a 
fundamental kind gf human reasoning. ]t addresses the basic issue of what kinds of 
questions a student should ask in attempting to understand a nrw subjett, device or 
electronic circuit. SOPHIE is able to judge! whether a measurement requested by ihe 
student (a* part of the proceii of tracking dawn a fault in the Circuit} is a reasonable 
question on ihe ha sis of what the student aireid/ knows. If not, i.e. if the answer to the 
question is already entailed by information previously obtained by the student, then the 
program draws the student's attention to the redundancy and discusses the information 
which the student knew but faukd to consider. 

■ 
■i 

This study of human cognitive! processes Through the formalism* developed by Artificial 
Intelligence and the incorporation of that understanding into CAl programs Is only in the 
formative stages. But it is our hope that in th* coming years, the concept of personal computer- 
based tutors availabte to any student at any timE wrili become more and more of a reality. 



* 



7,3 The Glass Box Approach to Language Studies 



In the previous section, we sa* the application of Artifidai Intelligence research on language 
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comprehension and problem salving to the deugn of Intelligent CA1 programs. In this icctjon. ^s 
develop a more radical Application. 

I ■ 

The ICA3 tutors, no matter how intelligent, are euenciatly bl&tk boxci from the student's 
perspective. The possibility exists thai the concepts devetoped to design those programs might 
themselves be worthy object* of study by the students. We propose calling this a £*«Jf fre* 
approach, Unhke the "blatfc box" approach, the point here J* precisely to encourage the young 
students to look inside the box, to as* how jr works and to gain insight into their own language by 
so. doing. In using the phrase "Glass Box*, we follow Pwlsi (1*7-1) discussion of APL programs 

whose structure is available to the student to be studied and understood. 

■ 

i 
"■"■.. + 

An example Of the glais ba* approach is illustrated by SUggeitmg a new kind df Language 

■ i V ' 
Laboratory. We envision a computer-bated environment m which students, would have the 

opportunity to explore the structure of language by designing;, programming and Testing various 

kinds of language comprehension program!. 

The virtues of such a laboratory approach lie in the active nature of the student's 

involvement, the possibility of the student enveloping his ideas in a personal way not rigidly 
limited by the teacher's approach (but reasonably circumscribed by the goals of designing a 
successful program), and the exposure to new computational theniies of language that are in many 
ways, superior CO traditional grammars as conceptual frameworks for the study of language at 
school. 



As an example of the power of these new approaches to language, we mention the possibility 
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that they hotel out for a unified approach to the s'uay of knowledge. This was illustraTed earber 
by our dlttUULOll Of Frame Theory in which Ihr Lame basic knowledge-representation technique* 
w«e seen to apply to grammatical a now ledge as welt as other kind* of knowledge concerned with 
themes, context* and semantic meanings. The exciting poisib-ihty exists of diminishing the 
comprtmerita lization of school knowledge. Linguistic knowledge becomes less of a "different" 
thing, apart from other knowledge. 

Our series of roles for language understanding machines ha* moved progXttiSi Vely away from 
a focus on the machine to a focus on the linguistic ideas which make the machine possible. We 
conclude this essay by genera hung this approach beyond the linguistic domain. 



i i 



7.4 The Articulate Learner 



In thii section, we develop further the notion of a Unified approach to educational curricula 
by introducing the- notion of Arttculatt Ltarntng. Our discussion j* motivated by the question of 

l 

Why students who find language Studies in school challenging and who do well in them often find 
the mathematics curriculum a confusing and impossible maze. A traditional answer is, that there U 
such a thing a* quantitative mathematical abjhties and qualitative language-oriented abilities. 
Such a belief Is mirrored in alt of the Achievement Tests that student* are asked to tale in which 
there are clearly demarcated language and mathematics section*. 

We should like to raise the possibility chat the traditional mat hematic* curriculum of grades 1- 
12 lacks certain dimensions which na*e it understandably obscure compared to language studios. 
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In particular, ill baste falling Li in the extreme difficulty which it presents for a student or teacher 
who attempts to be articulate about the problem solving process. 

in language &udie?, a teacher cart discuss the many dimensions nf a srory written by a student: 
ill grammar, its theme, its narrative Structure, the plot, the seEting, In centrist, what can A teacher 
tiiscuLs about the failure of a student to solve an arithmetic problem or an integral? For example, 
we Kave seen a child go to the blackboard and solve an arithmetic problem in the following way: 



3S 
+ 35 

EI 



The teacher involved had little to say besides observing [hat the answer was wron£. We 
believe that the earlier discussion of M and Epistemology has some very definite suggestions for 
altering the mathematics curricula to allow the student and the teacher re be far mare articulate 
about the knowledge and problem solving invalved. 



For the above example, we imagine an environment tn which the first kind of knowledge that 
a student is exposed to is the struuure of programs and the related ideas of planning and 
debugging. Skill in designing procedures lies at the heart of achieving competence in mathematics 
(and m writ Lug essays as well). Given access re a vocabulary for programs, plans and bugs. w e 
believe that Mud en t and teacher could be articular about describing the particular algorithm used 
by the student in reaching the "Bit)' answer, in identifying the bugs, and in debugging the addition 
program to yield a correct result. For the above example, we imagine a discussion similar to the 
earlier description of the HACKER program's reasoning regarding Its problem solving in the 
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Blocks World, The student had mauered an addition procedure for single digits. Hit goal in Ihii 
problem ii to apply that procedure as a titi-muttm to the more ccmplejf goal of adding multi-digit 
numbers. His solution Indicates that he folfowed the reasonable ttnwr £te« of assuming that |he 
sum of multhdigit numbers could be achieved |>y adding the columns tndtprnrfmtty. Linear 
planning commits trie- problem solve: to have iome method of putting together the solutions of the 
sub-problems w-bkh were arrived ai independently. The answer of *olCT indicates (hat our student 
utilised the simple irtifite of concatenating the ;w a partial solutions. The bug. at course, was in 
assuming the complete independent of the column addiiions. The drugging is accomplished by 
raiting account of this interaction through ordering the columnar additions rig ht-to- left and 
Ullliiing a "carry" tima UTiiitUTt. 

The reader may find a vocabulary involving such terms as linearity, bug, debugging, 
independence and data structure confusing; and must certainly believe that such a vocabulary is 
hardly useable- with an elemeniary school student. This is true, of course, given the current 
curriculum. However, our proposal is not Simply to add this vocabulary to the traditionat 
curriculum. It is more far reaching. The proposal is to redesign the curriculum to allow 
introduction of this and related concepts at an early stage, 

This is being done in a preliminary way at to. IT. 1 * LOGO project (Papert, ]$?2) in which 
new com pupation -based approaches to mathematics, science and music are currently being 

developed. As an example, elementary school students are introduced to the above set of 
programming; concepts and to rradnonal mathemaiical concepts in the contexr of programming a 
computer display to draw pictures of their choice (figure 3 ), 
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Figure e 

Typical Pictures IHcavtj by ChlldrSA 

FTOffl (Goldn tela 1974. Pp. 5 irad 21), 
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The children construct (he programs cheroselvcs in LOGO, a programming l^n^u^e 
espressly designed for students. A succeuful LOCO experience involves not wily the student 
being self-motivated to constr.ua such prog rims (because of the enjoyment pf creating pictures), 
but also involves discussions between mjden; and teacher, between student and Student, and 
between the student and himself. Ln which the student becomes verbal about his reasoning and 
learns to use a vocabulary describing hn plans, nil progjams, and, most importantly, his. bugs. 

The design of new educational environment! m which the student is introduced to the 
concepts that AI has found essential to the construction of intelligent programs, is only in a. 
beginning state, But our experience with childien in teaching them Turtle Geometry, Computer- 
based Physic*, and Computer-based Composing lead* us to believe that the approach is promising' 
and potentially revolutionary. Furthcrmare, it seem* w lead to a situanon where students untie 
articulate about their own prabiem solving, regardless of whether it is in the mathematical or 
linguistic domain. 



Zj^E pistemolagy gndj Education 

1 ■■■■■ *" ■* 

The role of a *now ledge-oriented A] approach to the study of language comprehension has 
its counterparts in all areas of cognitive science, and each of these carries implications for 
education. 

We have argued again and again that a major thrust of AI research has been to find 
representations of knowkdge which lend themselves to effective integration into larger knowledge 
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systems. The analogous problem for education is obvLDiii: Art the representations of knowledge 
which hive proven them sel v es effective in augmenting machine intelligence also of value in 
augmenting human intelligent? And, if not, does a sunilai inquiry into the forms of knowledge 
suitable for human thinking yj«]d. a different set of modes of representation? Futhermore, the 
epiitemolo^ical paradigm for the study of learning asks questions like: What knowledge docs a 
good learner use in a given bunion? What knowledge would mate learning a simpler process 
than ic appears tq an observer who does not know this knowledge is being used, or to a. learner 
who lacks it? 

Very little attention has been paid CO problems of this iort, Hut signs of a rich area of the 

utmost importance seem to be present in the results of such preliminary investigations as the 

construct ion of Intelligent Computer Aided Instruction programs containing procedural models of 

human inference and learning, the design of new approaches to learning based on Debugging 

Theory,, and the design of new curricula such as Turtle Geometry based on a nonon of the 

Articulate Learner. 

■ 

It ii our belief that future explorations into the nature of language. Df various subject 
domains and of the elements of intelligence itself - if pursued from an en istemologica.1 perspective 

grounded, in the computational metaphor ■- will provide the foundation for deeper incomes of ih? 
nature of language and of the nature nf learning Furthermore, we expect such theories to have? 
practical consequences, in the form of program! that comprehend language and in the design of 
new kinds of computer-based educational environments. 
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Epilogue "CpnlTQrGt'aiKS about EduaataiUh 1 and Tnnat&rtea-s 

The genr-al issues about the Theory of eduction discussed in iection 7 cut across a number of 
acute current controversies about the i-«la:icjns. between linguistic abilities In the developing child 
and other cognitive functions including learning. Different approaches to linguistics and to 
cognitive theory appear to lead theorist! :a very different opinions or nueitloill such as the 
dematn-tpttiflctiy of language skills; then relation to particular brain structures; and the extent of 
their inn&tfnt$). For example, one sees a very sharp opposition between the Chamskian and 
Piagetian lines of thinking (Chomsky, ]975). Fo: the former, there is a definite Ungiautc 
compel f net " which U extremely specific to language <i.e. has very little overlap wuh Other cognitive 
abilities) and which is determined by innate biological' structure*. Adherents of the Piagetian 
school h on the other hand, tend to see linguistic competence as sharing major cognitive processes 
with other intellectual domains and ;o be explicable more In terms of a psycho-developmental 
process. 

Any such brief summary of complex theories necessarily over's imp iifies the respective 
positions., Nevertheless, the oppositions, though mote modulated, are real and of fundamental and 
urgent importance. We see work in Artificial Intelligence as having 3. very direct bearing on thu 
debate. This does not mean that we are anlc to rtnlvt the issues. Rather we believe that 
amputattimal mdtii provide a thec-raticai overview necessary even for a clear statemenr or the 
issues- This belief is reinforced, by the increasingly rapid spread of computational ideas into the 
work of psychologists and even philosophers whose views fall on both sides of these controversies. 
For example, see (Fodor, 11375) for the computational formulation of an extreme "nativist" position 
by a psychologist who disagrees fundamentally w;'.h much of the "main-line" Al trends. See 
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<Norman, 1975; Bobiow. L975) for Che view* of plyehckigLlli sympa th-euc to the tind of At theory 
treated in rKJS paper; 

In tGnduiion, though She debate* about Intelligence and co|ni[it>n rage both in and out of AI. 
and the deader may have objected to many of che specific o-b&ervaiicns that we have made 
regarding frarric theory and Other jjsuei. we belike Lhal there already hai been Lhis f undament*! 
slid profound change; that the language of computation has become the proper dialect for 
discussing rhe banc issues at both psychology and education. 



i 
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